10 



- 52 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Bujard, Hermann 
Gossen, Manfred 
Sal f eld Jochcn ^ 
Voss, Jeffrey W. 

(ii) TITLE OF INVENTION: Animals Transgenic for a Tetracycline- 

Controlled Transcriptional Transactivator 

<iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Lahive & Cockfield 

15 * (B) STREET: 60 State Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP : 02109-1875 

20 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: ASCII text 

25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

30 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/383,754 

(B) FILING DAE: 14-JUN-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/076,327 
35 (B) FILING DAE: 14-JUN-1993 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: DeConti, Giulio A . Jr. 

(B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE /DOCKET NUMBER: BBI-013CP2 



40 



ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (617) 227-5941 



4 5 INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS 



- 53 - 



(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Herpes Simplex Virus 
10 (B) STRAIN: K12 , KOS 

(vii) IMMEDIATE SOURCE 

(B) CLONE: tTA transact i va tor 

!5 (ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 1..1008 

(ix) FEATURE: 
20 (A) NAME /KEY : mRNA 

(B) LOCATION : 1 . .1008 

(ix) FEATURE: 

(A) NAME /KEY : misc. binding 
25 (B) LOCATION: 1..207 

(ix) FEATURE: 

(A) NAME /KEY : misc. binding 

(B) LOCATION: 208.. 335 

30 

(ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1005 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 
Met Ser Arg Leu Asp Lys Ser Lys Val He Asn Ser Ala Leu Glu Leu 

15 



45 



55 



1 5 10 



4 0 CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 
Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 

20 25 30 



AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA AAA AAT AAG 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 

35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGO CAC CAT 

Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 



50 50 55 



60 



ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 
Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 

80 



65 70 7 5 



AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC GAT GGA 
Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Glv 
85 



48 



96 



144 



19: 



240 



288 



15 



25 



30 



35 



40 



45 



50 



55 



GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 



CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 

120 125 

10 AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 14Q 



GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 
Val Leu Glu As P Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 



CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 

175 



20 165 170 



TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 19Q 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC GCG 
He He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 

TAC AGC CGC GCG CGT ACG AAA AAC AAT TAC GGG TCT ACC ATC GAG GGC 
Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr He Glu Gly 
210 215 220 



CTG CTC GAT CTC CCG GAC GAC GAC GCC CCC GAA GAG GCG GGG CTG GCG 
Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 

240 



225 230 235 



GCT CCG CGC CTG TCC TTT CTC CCC GCG GGA CAC ACG CGC AGA CTG TCG 
Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 
245 250 255 

ACG GCC CCC CCG ACC GAT GTC AGC CTG GGG GAC GAG CTC CAC TTA GAC 
Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp 
260 265 270 

GGC GAG GAC GTG GCG ATG GCG CAT GCC GAC GCG CTA GAC GAT TTC GAT 
Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 28 o 285 

CTG GAC ATG TTG GGG GAC GGG GAT TCC CCG GGT CCG GGA TTT ACC CCC 
Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
290 295 300 

CAC GAC TCC GCC CCC TAC GGC GCT CTG GAT ATG GCC GAC TTC GAG TTT 

His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe 

305 n ttc 

JiU 31 5 320 

GAG CAG ATG TTT ACC GAT CCC CTT GGA ATT GAC GAG TAC GGT GGG TAG 
Glu Gin Met Phe Thr Asp Pro Leu Gly He Asp Glu Tyr Gly Gly 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



325 330 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 33 5 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
10 Met Ser Arg Leu Asp Lys Ser Lys Val He Asn Ser 



335 



Ala Leu Glu Leu 



5 10 15 



15 



Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 

20 25 3 0 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 

7 R a r\ 



Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 
20 50 55 60 



Thr His Phe Cys Pro Leu Glu Gly Gxu Ser Trp Gin Asp Phe Leu Arg 

80 



65 70 75 



25 Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 

85 90 95 

Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cvs 
130 135 140 



30 



35 



Val Leu Glu Asp Gin Glu His Gin Val Al 



a Lys Glu Glu Arg Glu Thr 

145 150 1 rr 

155 X60 

4 0 Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 

165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

4 5 

He He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 

Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr He Glu Glv 

50 21° 215 220 

Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 

225 230 235 240 

55 Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 



- 56 

245 250 255 

Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp 
260 265 270 

Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 2R0 oor- 



285 



Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
10 290 295 300 



His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe 

305 ^ i n ->ti- 

J -' J 320 

15 Glu Gin Met Phe Thr Asp Pro Leu Gly He Asp Glu Tyr Gly Gly 

335 



325 330 



2b 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(1) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inea r 

(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Herpes Simplex Virus 
30 (B) STRAIN: K12, KOS 

(C) INDIVIDUAL ISOLATE: tTAg transact ivator 

(ix) FEATURE: 

(A) NAME / KEY : exon 
35 (B) LOCATION: 1 . . 894 

(ix) FEATURE: 

(A) NAME /KEY : mRNA 

(B) LOCATION: 1 . .8 94 

40 

(ix) FEATURE: 

(A) NAME/KEY: misc. binding 

(B) LOCATION: 1 . .2 07 

45 (ix) FEATURE: 

(A) NAME /KEY : misc. binding 

(B) LOCATION: 208.. 297 

(ix) FEATURE: 
50 (A) NAME / KEY : CDS 

(B) LOCATION: 1 . .891 



(xi) SEQUENCE DESCRIPTION 

ATG TCT AGA TTA GAT AAA AGT AAA 
Met Ser Arg Leu Asp Lys Ser Lys 



SEQ ID NO : 3 : 

GTG ATT AAC AGC GCA TTA GAG CTG 
Val He Asn Ser Ala Leu Glu Leu 



- 3 / - 

1 5 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 
Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

5 AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA AAA AAT AAG 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 
Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 

10 50 55 60 

ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 
Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

AAT AAC GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC GAT GGA 
15 Asri Asn Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 

85 90 95 



2 0 



CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 

115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cvs 
25 130 135 140 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 

145 150 ice 

3U 15 5 160 



30 



CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 19Q 

3 5 ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCT GAT 
He He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Asp 
195 200 20 5 

CCA TCG ATA CAC ACG CGC AGA CTG TCG ACG GCC CCC CCG ACC GAT GT^ 
Pro Ser He His Thr Arg Arg Leu Ser Thr Ala Pro Pro Thr Asn Val 
40 210 215 220 



AGC CTG GGG GAC GAG CTC CAC TTA GAC GGC GAG GAC GTG GCG ATG G<~G 

Ser Leu Gly Asp Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala 

225 230 235 240 

CAT GCC GAC GCG CTA GAC GAT TTC GAT CTG GAC ATG TTG GGG GAC GGG 



96 



144 



192 



240 



288 



GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 3 36 
Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 



384 



432 



480 



528 



576 



624 



67; 



720 



76 8 



10 



• .. # 

- ^ o - 

His Ala Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Asp Gly 
245 250 255 

GAT TCC CCG GGT CCG GGA TTT ACC CCC CAC GAC TCC GCC CCC TAC GGC 
Asp Ser Pro Gly Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly 
260 265 27Q 

GCT CTG GAT ATG GCC GAC TTC GAG TTT GAG CAG A'l'G TTT ACC GAT GCC 
Ala Leu Asp Met Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala 
275 280 285 

CTT GGA ATT GAC GAG TAC GGT GGG TTC TAG 
Leu Gly lie Asp Glu Tyr Gly Gly Phe 
290 295 



816 



864 



894 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 97 amino acids 

IS / n \ rnr, _ _ ■ 

^ \ o J i iffi : diiiino dciu 

( D ) TOPOLOGY: linear 

(n) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Ser Arg Leu Asp Lys Ser Lys Val He Asn Ser Ala Leu Glu Leu 
20 1 5 10 15 

Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 



25 



30 



40 



4 5 



Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 

35 40 45 

Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 

50 55 60 



Asp Phe Leu Arg 



Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin 

65 70 75 80 

Asn Asn Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 

35 85 90 95 

Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 

!00 105 no 



Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 



130 135 



140 



50 



Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 

145 150 155 i 60 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 

165 170 i7 5 



- - 



Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 19Q 

lie He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Asp 
195 200 205 

Pro Ser lie His Thr Arg Arg Leu Ser Thr Ala Pro Pro Thr Asp Val 
210 215 220 

Ser Leu Gly Asp Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala 

225 2 3 0 

235 240 

His Ala Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Asp Gly 
15 245 250 255 

Asp Ser Pro Gly Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly 

260 265 270 



10 



20 



2 5 



3 0 



3 5 



Ala Leu Asp Met Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala 
275 280 285 

Leu Gly He Asp Glu Tyr Gly Gly Phe 
290 295 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(B) STRAIN: K12 , Towne 

4 0 (ix) FEATURE : 

(A) NAME /KEY : mRNA 

(B) LOCATION: 382.. 450 



50 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


5 : 






GAATTCCTCG 


AGTTTACCAC 


TCCCTATCAG 


TGATAGAGAA 


AAGTGAAAGT 


CGAGTTTACC 


6 0 


ACTCCCTATC 


AGTGATAGAG 


AAAAGTGAAA 


GTCGAGTTTA 


CCACTCCCTA 


TCAGTGATAG 


12 0 


AGAAAAGTGA 


AAGTCGAGTT 


TACCACTCCC 


TATCAGTGAT 


AGAGAAAAGT 


GAAAGTCGAG 


18 0 


TTTACCACTC 


CCTATCAGTG 


ATAGAGAAAA 


GTGAAAGTCG 


AGTTTACCAC 


TCCCTATCAG 


24 0 


TGATAGAGAA 


AAGTGAAAGT 


CGAGTTTACC 


ACTCCCTATC 


AGTGATAGAG 


AAAAGTGAAA 


3 00 


GTCGAGCTCG 


GTACCCGGGT 


CGAGTAGGCG 


TGTACGGTGG 


GAGGCCTATA 


TAAGCAGAGC 


360 



TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 

AAGACACCGG GACCGATCCA GCCTCCGCGG 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
-> \A; LENGTH; 4 50 base pdizts 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(B) STRAIN: Towne 

15 

(ix) FEATURE : 

(A) NAME /KEY : mRNA 

v o j jjwu>\ ± _L wr* : joz . .450 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GAATTCCTCG ACCCGGGTAC C GAG C T CG AC TTTCACTTTT CTCTATCACT GATAGGGAGT 
GGTAAACTCG ACTTTCACTT TTCTCTATCA CTGATAGGGA GTGGTAAACT CGACTTTCAC 
TTTTCTCTAT CACTGATAGG GAGTGGTAAA CTCGACTTTC ACTTTTCTCT ATCACTGATA 
GGGAGTGGTA AACTCGACTT TCACTTTTCT CTATCACTGA TAGGGAGTGG TAAACTCGAC 

25 TTTCACTTTT CTCTATCACT GATAGGGAGT GGTAAACTCG ACTTTCACTT TTCTCTATCA 
CTGATAGGGA GTGGTAAACT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 
TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 
AAGACACCGG GACCGATCCA GCCTCCGCGG 
(2) INFORMATION FOR SEQ ID NO : 7 : 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Herpes Simplex Virus 



420 
450 



60 
120 
180 
240 
300 
360 
420 
450 



(B) STRAIN : KOS 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GAGCTCGACT TTCACTTTTC TCTATCACTG ATAGGGAGTG GTAAACTCGA CTTTCACTTT 



60 



- 61 

TCTCTATCAC TGATAGGGAG TGGTAAACTC GACTTTCACT TTTCTCTATC ACTGATAGGG 
AGTGGTAAAC TCGACTTTCA CTTTTCTCTA TCACTGATAG GGAGTGG TAA ACTCGACTTT 
CACTTTTCTC TATCACTGAT AGGGAGTGGT AAACTCGACT TTCACTTTTC TCTATCACTG 
ATAGGGAGTG GTAAACTCGA CTTTCACTTT TCTCTATCAC TGATAGGGAG TGGTAAACTC 
GAGATCCGGC GAATTCGAAC ACGCAGATGC AGTCGGGGCG GCGCGGTCCG AGGTCCACTT 
CGCATATTAA GGTGACGCGT GTGGCCTCGA ACACCGAG 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(B) STRAIN: Towne (hCMV) 

(vii) IMMEDIATE SOURCE : 

(B) CLONE: pUHD BGR3 

20 



10 



25 



30 



120 

180 

240 

300 

360 

398 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO : 


8 : 






CTCGAGTTTA 


CCACTCCCTA 


TCAGTGATAG 


AGAAAAGTGA 


AAGTCGAGTT 


TACCACTCCC 


60 


TATCAGTGAT 


AGAGAAAAGT 


GAAAGTCGAG 


TTTACCACTC 


CCTATCAGTG 


ATAGAGAAAA 


120 


GTGAAAGTCG 


AGTTT AC C AC 


TCCCTATCAG 


TGATAGAGAA 


AAGTGAAAGT 


CGAGTTTACC 


180 


ACTCCCTATC 


AGTGATAGAG 


AAAAGTGAAA 


GTCGAGTTTA 


CCACTCCCTA 


TCAGTGATAG 


240 


AGAAAAGTGA 


AAGTCGAGTT 


TACCACTCCC 


TATCAGTGAT 


AGAGAAAAGT 


GAAAGTCGAG 


300 


CTCGGTACCC 


GGGTCGAGTA 


GGCGTGTACG 


GTGGGAGGCC 


TATATAAGCA 


GAGCTCGTTT 


360 


AGTGAACCGT 


CAGATCGCCT 


GGAGACGCCA 


TCCACGCTGT 


TTTGACCTCC 


ATAGAAGACA 


420 


CCGGGACCGA 


TCCAGCCTCC 


GCGGCCCCGA 


ATTCGAGCTC 


GGTACCGGGC 


CCCCCCTCGA 


480 


GGTCGACGGT 


ATCGATAAGC 


TTGATATCGA 


ATTCCAGGAG 


GTGG AG AT C C 


GCGGGTCCAG 


540 


CCAAACCCCA 


CACCCATTTT 


CTCCTCCCTC 


TGCCCCTATA 


TCCCGGCACC 


CCCTCCTCCT 


6 00 


AGCCCTTTCC 


CTCCTCCCGA 


GAGACGGGGG 


AGGAGAAAAG 


GGGAGTTCAG 


GTCGACATGA 


66 0 


CTGAGCTGAA 


GGCAAAGGAA 


CCTCGGGCTC 


CCCACGTGGC 


GGGCGGCGCG 


CCCTCOCCCA 


72 0 


CCGAGGTCGG 


ATCCCAGCTC 


CTGGGTCGCC 


CGGACCCTGG 


CCCCTTCCAG 


GGGAGCCAGA 


780 


CCTCAGAGGC 


CTCGTCTGTA 


GTCTCCGCCA 


TCCCCATCTC 


CCTGGACGGG 


TTGCTCTTCC 


84 0 



10 



15 



20 



25 



30 



- <S? . 

CCCGGCCCTG TCAGGGGCAG AACCCCCCAG ACGGGAAGAC GCAGGACCCA CCGTCGTTGT 
CAGACGTGGA GGGCGCATTT CCTGGAGTCG AAGCCCCGGA GGGGGCAGGA GACAGCAGCT 
CGAGACCTCC AGAAAAGGAC AGCGGCCTGC TGGACAGTGT CCTCGACACG CTCCTGGCGC 
CCTCGGGTCC CGGGCAGAGC CACGCCAGCC CTGCCACCTG CGAGGCCATC AGCCCGTGGT 
GCCTGTTTGG CCCCGACCTT CCCGAAGACC CCCGGGCTGC CCCCGCTACC AAAGGGGTGT 
TGGCCCCGCT CATGAGCCGA CCCGAGGACA AGGCAGGCGA CAGCTCTGGG ACGGCAG CGG 
CCCACAAGGT GCTGCCCAGG GGACTGTCAC CATCCAGGCA GCTGCTGCTC CCCTCCTCTG 
GGAGCCCTCA CTGGCCGGCA GTGAAGCCAT CCCCGCAGCC CGCTGCGGTG CAGGTAGACG 
AGGAGGACAG CTCCGAATCC GAGGGCACCG TGGGCCCGCT CCTGAAGGGC CAACCTCGGG 
CACTGGGAGG CACGGCGGCC GGAGGAGGAG CTGCCCCCGT CGCGTCTGGA GCGGCCGCAG 
GAGGCGTCGC CCTTGTCCCC AAGGAAGATT CTCGCTTCTC GGCGCCCAGG GTCTCCTTGG 
CGGAGCAGGA CGCGCCGGTG GCGCCTGGGC GCTCCCCGCT GGCCACCTCG GTGGTGGATT 
TCATCCACGT GCCCATCCTG CCTCTCAACC ACGCTTTCCT GGCCACCCGC ACCAGGCAGC 
TGCTGGAGGG GGAGAGCTAC GACGGCGGGG CCGCGGCCGC CAGCCCCTTC GTCCCGCAGC 
GGGGCTCCCC CTCTGCCTCG TCCACCCCTG TGGCGGGCGG CGACTTCCCC GACTGCACCT 
ACCCGCCCGA CGCCGAGCCC AAAGATGACG CGTTCCCCCT CTACGGCGAC TTCCAGCCGC 
CCGCCCTCAA GATAAAGGAG GAGGAAGAAG CCGCCGAGGC CGCGGCGCGC TCCCCGCGTA 
CGTACCTGGT GGCTGGTGCA AACCCCGCCG CCTTCCCGGA CTTCCAGCTG GCAGCGCCGC 
CGCCACCCTC GCTGCCGCCT CGAGTGCCCT CGTCCAGACC CGGGGAAGCG GCGGTGGCGG 
CCTCCCCAGG CAGTGCCTCC GTCTCCTCCT CGTCCTCGTC GGGGTCGACC CTGGAGTGCA 
TCCTGTACAA GGCAGAAGGC GCGCCGCCCC AGCAGGGCCC CTTCGCGCCG CTGCCCTGCA 
AGCCTCCGGG CGCCGGCGCC TGCCTGCTCC CGCGGGACGG CCTGCCCTCC ACCTCCGCCT 
CGGGCGCAGC CGCCGGGGCC GCCCCTGCGC TCTACCCGAC GCTCGGCCTC AACGGACTCC 
CGCAACTCGG CTACCAGGCC GCCGTGCTCA AGGAGGGCCT GCCGCAGGTC TACACGCCCT 
ATCTCAACTA CCTGAGGCCG GATTCAGAAG CCAGTCAGAG CCCACAGTAC AGCTTCGAGT 
CACTACCTCA GAAGATTTGT TTGATCTGTG GGGATGAAGC ATCAGGCTGT CATTATGGTG 
TCCTCACCTG TGGGAGCTGT AAGGTCTTCT TTAAAAGGGC AATGGAAGGG CAGCATAACT 
ATTTATGTGC TGGAAGAAAT GACTGCATTG TTGATAAAAT CCGCAGGAAA AACTGCCCGG 
CGTGTCGCCT TAGAAAGTGC TGTCAAGCTG GCATGGTCCT TGGAGGGCGA AAGTTTAAAA 
AGTTCAATAA AGTCAGAGTC ATGAGAGCAC TCGATGCTGT TGCTCTCCCA CAGCCAGTGG 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
22 8 0 
2 34 0 
2400 
2460 
2 52 0 
2 580 
26 4 0 
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GCATTCCAAA TGAAAGCCAA CGAATCACTT TTTCTCCAAG TCAAGAGATA CAGTTAATTC 
CCCCTCTAAT CAACCTGTTA ATGAGCATTG AACCAGATGT GATCTATGCA GGACATGACA 
ACACAAAGCC TGATACCTCC AGTTCTTTGC TGACGAGTCT TAATCAACTA GGCGAGCGGC 
AACTTCTTTC AGTGGTAAAA TGGTCCAAAT CTCTTCCAGG TTTTCGAAAC TTACATATTG 
5 ATGACCAGAT AACTCTCATC CAGTATTCTT GGATGAGTTT AATGGTATTT GGACTAGGAT 
GGAGATCCTA CAAACATGTC AGTGGGCAGA TGCTGTATTT TGCACCTGAT CTAATATTAA 
ATGAACAGCG GATGAAAGAA TCATCATTCT ATTCACTATG CCTTACCATG TGGCAGATAC 
CGCAGGAGTT TGTCAAGCTT CAAGTTAGCC AAGAAGAGTT CCTCTGCATG AAAGTATTAC 
TACTTCTTAA TACAATTCCT TTGGAAGGAC TAAGAAGTCA AAGCCAGTTT GAAGAGATGA 
10 GATCAAGCTA CATTAGAGAG CTCATCAAGG CAATTGGTTT GAGGCAAAAA GGAGTTGTTT 
CCAGCTCACA GCGTTTCTAT CAGCTCACAA AACTTCTTGA TAACTTGCAT GATCTTGTCA 
AACAACTTCA CCTGTACTGC CTGAATACAT TTATCCAGTC CCGGGCGCTG AGTGTTGAAT 
TTCCAGAAAT GATGTCTGAA GTTATTGCTG CACAGTTACC CAAGATATTG GCAGGGATGG 
TGAAACCACT TCTCTTTCAT AAAAAGTGAA TGTCAATTAT TTTTCAAAGA ATTAAGTGTT 
15 GTGGTATGTC TTTCGTTTTG GTCAGGATTA TGACGTCTCG AGTTTTTATA ATATTCTGAA 
AGGGAATTCC TGCAGCCCGG GGGATCCACT AGTTCTAGAG GATCCAGACA TGATAAGATA 
CATTGATGAG TTTGGACAAA CCACAACTAG AATGCAGTGA AAAAAATGCT TTATTTGTGA 
AATTTGTGAT GCTATTGCTT TATTTGTAAC CATTATAAGC TGCAATAAAC AAGTTAACAA 
CAACAATTGC ATTCATTTTA TGTTTCAGGT TCAGGGGGAG GTGTGGGAGG TTTTTTAAAG 
2 0 CAAGTAAAAC CTCTACAAAT GTGGTATGGC TGATTATGAT CCTGCAAGCC TCGTCGTCTG 
GCCGGACCAC GCTATCTGTG CAAGGTCCCC GGACGCGCGC TCCATGAGCA GAGCGCCCGC 
CGCCGAGGCA AGACTCGGGC GGCGCCCTGC CCGTCCCACC AGGTCAACAG GCGGTAACCG 
GCCTCTTCAT CGGGAATGCG CGCGACCTTC AGCATCGCCG GCATGTCCCC TGGCGGACGG 
GAAGTATCAG CTCGACCAAG CTTGGCGAGA TTTTCAGGAG CTAAGGAAGC TAAAATGGAG 
AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT GGCATCGTAA AGAACATTTT 
GAGGCATTTC AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT GCATTAATGA 
ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCTCTTCCGC TTCCTCGCTP 
ACTGACTCGC TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG 
GTAATACGGT TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAAGATGTG AG C AAAAGG C 
CAGCAAAAGG CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC 
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30 



2700 

2760 

2820 

2880 

2940 

3000 

3060 

3 12 0 

3 18 0 

3240 

3300 

3360 

3420 

3480 

3540 

36 00 

36 6 0 

3 72 0 

3780 

3840 

3 9 0 0 

3 96 0 
4020 

4 08 0 
4140 
4 2 0 0 
4 26 0 
4 3 2 0 
4 3 8 0 
444 C 
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CCCCCTGACG AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA 
CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC 
CTGCCGCTTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAA 
TGCTCACGCT GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG 
CACGAACCCC CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC 
AACCCGGTAA GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 
GCGAGGTATG TAGGCGG TGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT 
AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT 
GGTAGCTCTT GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG 
CAGCAGATTA CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG 
TCTGACGCTC AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA 
AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA 
TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG 
ATCTGTCTAT TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA 
CGGGAGGGCT TACCATCTGG CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG 
GCTCCAGATT TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT 
GCAACTTTAT CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT 
TCGCCAGTTA ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC 
TCGTCGTTTG GTATGGCTTC ATTCAGCTC C GGTTCCCAAC GATCAAGGCG AGTTACATGA 
TCCCCCATGT TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT 
AAGTTGGCCG CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC 
ATGCCATCCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA 
TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA 
CATAGCAGAA CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA 
AGGATCTTAC CGCTGTTGAG ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT 
TCAGCATCTT TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 
GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA 
TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG GAT AC AT ATT TGAATGTATT 
TAGAAAAATA AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC 
TAAGAAACCA TTATTATCAT GACATTAACC TATAAAAATA GGCGTATCAC GAGGCCCTTT 



4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5*3 4 0 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

58 2 0 

58 8 0 

5 94 0 

6 000 
6 0 6 0 
6 12 0 
6180 
6240 




CGTC 

6244 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4963 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pUHD BGR4 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


9 : 








\_v_A^, i i A 


i L-A^ i. (jA i A(^ 


AUAAAAUTGA 


AAGTCGAGTT 


TACCACTCCC 


60 


TATCAGTGAT 


AGAGAAAAGT 


GAAAGTCGAG 


TTTACCACTC 


CCTATCAGTG 


ATAGAGAAAA 


120 


GTGAAAGTCG 


AGTTTACCAC 


TCCCTATCAG 


TGATAGAGAA 


AAGTGAAAGT 


CGAGTTTACC 


180 


ACTCCCTATC 


AGTGATAGAG 


AAAAGTGAAA 


GTCGAGTTTA 


CCACTCCCTA 


TCAGTGATAG 


24 0 


AGAAAAGTGA 


AAGTCGAGTT 


TACCACTCCC 


TATCAGTGAT 


AGAGAAAAGT 


GAAAGTCGAG 


300 


CTCGGTACCC 


GGGTCGAGTA 


GGCGTGTACG 


GTGGGAGGCC 


TATATAAGCA 


GAGCTCGTTT 


360 


AGTGAACCGT 


CAGATCGCCT 


GGAGACGCCA 


TCCACGCTGT 


TTTGACCTCC 


ATAGAAGACA 


420 


CCGGGACCGA 


TCCAGCCTCC 


GCGGCCCCGA 


ATTCCGGCCA 


CGACCATGAC 


CATGACCCTC 


480 


CACACCAAAG 


CATCTGGGAT 


GGCCCTACTG 


CATCAGATCC 


AAGGGAACGA 


GCTGGAGCCC 


540 


CTGAACCGTC 


CGCAGCTCAA 


GATCCCCCTG 


GAGCGGCCCC 


TGGGCGAGGT 


GTACCTGGAC 


6 00 


AGCAGCAAGC 


CCGCCGTGTA 


CAACTACCCC 


GAGGGCGCCG 


CCTACGAGTT 


CAACGCCGCG 


660 


GCCGCCGCCA 


ACGCGCAGGT 


CTACGGTCAG 


ACCGGCCTCC 


CCTACGGCCC 


CGGGTCTGAG 


720 


GCTGCGGCGT 


TCGGCTCCAA 


CGGCCTGGGG 


GGTTTCCCCC 


CACTCAACAG 


CGTGTCTCCG 


"8 0 


AGCCCGCTGA 


TGCTACTGCA 


CCCGCCGCCG 


CAGCTGTCGC 


CTTTCCTGCA 


GCCCCACGGC 


84 0 


CAGCAGGTGC 


CCTACTACCT 


GGAGAACGAG 


CCCAGCGGCT 


ACACGGTGCG 


CGAGGCCGGC 


900 


CCGCCGGCAT 


TCTACAGGCC 


AAATTCAGAT 


AATCGACGCC 


AGGGTGGCAG 


AGAAAGATTG 


96 0 


GCCAGTACCA 


ATGACAAGGG 


AAGTATGGCT 


ATGGAATCTG 


CCAAGGAGAC 


TCGCTACTGT 


1020 


GCAGTGTGCA 


ATGACTATGC 


TTCAGGCTAC 


CATTATGGAG 


TCTGGTCCTG 


TGAGGGCTGC 


10 8 0 


AAGGCCTTCT 


TCAAGAGAAG 


TATTCAAGGA 


CATAACGACT 


ATATGTGTCC 


AGCCACCAAC 


114 0 


CAGTGCACCA 


TTGATAAAAA 


CAGGAGGAAG 


AGCTGCCAGG 


CCTGCCGGCT 


CCGCAAATGC 


12 00 
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TACGAAGTGG GAATGATGAA AGGTGGGATA CGAAAAGACC GAAGAGGAGG GAGAATGTTG 
AAA C AC AAG C G C C AG AG AG A TGA TGGGGAG GGCAGGGGTG AAGTGGGGTC TGCTGGAGAC 
ATGAGAGCTG CCAACCTTTG GCCAAGCCCG CTCATGATCA AACGCTCTAA G AAG AAC AG C 
CTGGCCTTGT CCCTGACGGC CGACCAGATG GTCATGGCCT TGTTGGATGC TGAGCCCCCC 
ATACTCTATT CCGAGTATGA TCCTACCAGA CCCTTCAGTG AAGCTTCGAT GATGGGCTTA 
CTGACCAACC TGGCAGACAG GGAGCTGGTT CACATGATCA ACTGGGCGAA GAGGGTGCCA 
GGCTTTGTGG ATTTGACCCT CCATGATCAG GTCCACCTTC TAGAATGTGC CTGGCTAGAG 
ATCCTGATGA TTGGTCTCGT CTGGCGCTCC ATGGAGCACC CAGTGAAGCT ACTGTTTGCT 
CCTAACTTGC TCTTGGACAG GAACCAGGGA AAATGTGTAG AGGGCATGGT GGAGATCTTC 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 



30 



GACATGCTGC 


TGGCTACATC 


ATCTCGGTTC 


CGCATGATGA 


ATCTGCAGGG 


AGAGGAGTTT 


1800 


GTGTGCCTCA 


AATCTATTAT 


TTTGCTTAAT 


TCTGGAGTGT 


AC AC AT T T r *'~ r ' 


CTCCAGCACC 


18 6 0 


CTGAAGTCTC 


TGGAAGAGAA 


GGACCATATC 


CACCGAGTCC 


TGGACAAGAT 


CACAGACACT 


1920 


TTGATCCACC 


TGATGGCCAA 


GGCAGGCCTG 


ACCCTGCAGC 


AGCAGCACCA 


GCGGCTGGCC 


1980 


CAGCTCCTCC 


TCATCCTCTC 


CCACATCAGG 


CACATGAGTA 


ACAAAGGCAT 


GGAGCATCTG 


2040 


TACAGCATGA 


AGTGCAAGAA 


CGTGGTGCCC 


CTCTATGACC 


TGCTGCTGGA 


GATGCTGGAC 


2100 


GCCCACCGCC 


TACATGCGCC 


CACTAGCCGT 


GG AGGGG CAT 


CCGTGGAGGA 


GACGGACCAA 


2160 


AGCCACTTGG 


CCACTGCGGG 


CTCTACTTCA 


TCGCATTCCT 


TGCAAAAGTA 


TTACATCACG 


2220 


GGGGAGGCAG 


AGGGTTTCCC 


TGCCACAGTC 


TGAGAGCTCC 


CTGGCGGAAT 


TCGAGCTCGG 


2280 


TACCCGGGGA 


TCCTCTAGAG 


GATCCAGACA 


TGATAAGATA 


CATTGATGAG 


TTTGGACAAA 


2340 


CCACAACTAG 


AATGCAGTGA 


AAAAAATGCT 


TTATTTGTGA 


AATTTGTGAT 


GCTATTGCTT 


2400 


TATTTGTAAC 


CATTATAAGC 


TGCAATAAAC 


AAGTTAACAA 


CAACAATTGC 


ATTCATTTTA 


2460 


TGTTTCAGGT 


TCAGGGGGAG 


GTGTGGGAGG 


TTTTTTAAAG 


CAAGTAAAAC 


CTCTACAAAT 


2520 


GTGGTATGGC 


TGATTATGAT 


CCTGCAAGCC 


TCGTCGTCTG 


GCCGGACCAC 


GCTATCTGTG 


2 58 0 


CAAGGTCCCC 


GGACGCGCGC 


TCCATGAGCA 


GAGCGCCCGC 


CGCCGAGGCA 


AGACTCGGGC 


2 6 4 0 


GGCGCCCTGC 


CCGTCCCACC 


AGGTCAACAG 


GCGGTAACCG 


GCCTCTTCAT 


CGGGAATGCG 


2 700 


CGCGACCTTC 


AGCATCGCCG 


GCATGTCCCC 


TGGCGGACGG 


GAAGTATCAG 


CTCGACCAAG 


2 76 0 


CTTGGCGAGA 


TTTTCAGGAG 


CTAAGGAAGC 


TAAAATGGAG 


AAAAAAATCA 


CTGGATATAC 


2 82 0 


CACCGTTGAT 


ATATCCCAAT 


GGCATCGTAA 


AGAACATTTT 


GAGGCATTTC 


AGTCAGTTGC 


2880 


TCAATGTACC 


TATAACCAGA 


CCGTTCAGCT 


GCATTAATGA 


ATCGGCCAAC 


GCGCGGGGAG 


2940 


AGGCGGTTTG 


CGTATTGGGC 


GCTCTTCCGC 


TTCCTCGCTC 


ACTGACTCGC 


TGCGCTCGGT 


3000 
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CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG GTAATACGGT TATCCACAGA 
ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 
TAAAAAGGCC GCGTTGCTGG CGTTTTTC C A TAGGCTCCGC CCCCCTGACG AGCATCACAA 
AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT ACCAGGCGTT 
TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA CCGGATACCT 
GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAA TGCTCACGCT GTAGGTATCT 
CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 
CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA GACACGACTT 
ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG TAGGCGGTGC 
TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG TATTTGGTAT 
CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT RfiTJfiTTrrr 
ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 
AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC AGTGGAACGA 
AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA CCTAGATCCT 
TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA CTTGGTCTGA 
CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC 
CATAGTTGCC TGATCCCCGT CGTGTAGATA ACTACGATAC GGGAGGGCTT ACCATCTGGC 
CCCAGTGCTG CAATGATACC GCGAGACCCA CGCTCACCGG CTCCAGATTT AT C AG C AAT A 
AACCAGCCAG CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC CGCCTCCATC 
CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 
AACGTTGTTG CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG TATGGCTTCA 
TTCAGCTCCG GTTCCCAACG ATCAAGGCGA GTTACATGAT CCCCCATGTT GTGCAAAAAA 
GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC AGTGTTATCA 
CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT AAGATGCTTT 
TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG GCGACCGAGT 
TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG 
CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACCGCTGTTGAGA 
TCCAGTTCGA TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC 
AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG AATAAGGGCG 
ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT ATTATTGAAG CATTTATCAG 



3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3 660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4 56 0 

4 620 

4680 

4740 

4800 



GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA ACAAATAGGG 
GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTCT AAGAAACCAT TATTATCATG 
ACATTAACCT ATAAAAATAG GCGTATCACG AGGCCCTTTC GTC 

5 (2) INFORMATION FOR SEQ ID NO : 1 0 : 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 42 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TCGAGTTTAC CACTCCCTAT CAGTGATAGA GAAAAGTGAA AG 



4860 
4920 
4963 



42 



